🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
👂 Psychoacoustic Coding

Perceptual Audio, Masking Models, Hearing Science, Lossy Compression

JSQA: Speech Quality Assessment with Perceptually-Inspired Contrastive Pretraining Based on JND Audio Pairs
arxiv.org·8h
🎧Learned Audio
Investigating claims that GPUs can unlock "limitless music production potential"
musicradar.com·19h·
Discuss: Hacker News
🎧Learned Audio
CompressedVQA-HDR: Generalized Full-reference and No-reference Quality Assessment Models for Compressed High Dynamic Range Videos
arxiv.org·8h
🎬AV1 Encoding
A Neural Net For a Graphing Calculator?
hackaday.com·4h
🤖Advanced OCR
2025-07-16: Understanding Hallucination in Large Language Models: Challenges and Opportunities
ws-dl.blogspot.com·11h·
Discuss: ws-dl.blogspot.com
✨Effect Handlers
WiSec 2025 Spotlight: Security in the Inaudible World
esat.kuleuven.be·22h
📡Bluetooth Archaeology
A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction
arxiv.org·8h
🎧Vorbis Encoding
Mitigating Object Hallucinations via Sentence-Level Early Intervention
arxiv.org·8h
👁️Perceptual Coding
The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents
arxiv.org·2d
🎵Audio ML
Large Language Models and Non-Negative Matrix Factorization for Bioacoustic Signal Decomposition
arxiv.org·2d
📊Spectrograms
Boffins detail new algorithms to losslessly boost AI perf by up to 2.8x
theregister.com·2h·
Discuss: Hacker News
💻Local LLMs
Measuring and predicting visual fidelity
arxiv.org·8h
🌈Color Science
Deep Neural Encoder-Decoder Model to Relate fMRI Brain Activity with Naturalistic Stimuli
arxiv.org·8h
🧠Neural Codecs
Physics-Informed Transfer Learning for Data-Driven Sound Source Reconstruction in Near-Field Acoustic Holography
arxiv.org·1d
🎧Learned Audio
A Minimal DDPM
github.com·1d·
Discuss: Hacker News
🧠Machine Learning
Intel releases new tool to measure gaming image quality in real time —AI tool measures impact of upscalers, frame gen, others; Computer Graphics Video Quality M...
tomshardware.com·23h
📊Rate-Distortion Theory
ASMR AI: Generate AI ASMR Videos with High Quality ASMR Voice
asmrai.net·6h·
Discuss: Hacker News
🎙️Whisper
Mastering Dimensionality Reduction: A Comprehensive Guide to PCA, t-SNE, UMAP, and Autoencoders
dev.to·3h·
Discuss: DEV
📐Vector Dimensionality
Towards Spatial Audio Understanding via Question Answering
arxiv.org·2d
🎵Audio ML
EME-TTS: Unlocking the Emphasis and Emotion Link in Speech Synthesis
arxiv.org·8h
⚙️Compression Benchmarking
Loading...Loading more...
AboutBlogChangelogRoadmap